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ABSTRACT 

We present composite 3.6 and 4.5/im luminosity functions for cluster galaxies measured from the 
Spitzer Deep, Wide-Field Survey (SDWFS) for 0.3 < z < 2. We compare the evolution of m* for these 
luminosity functions to models for passively evolving stellar populations to constrain the primary epoch 
of star formation in massive cluster galaxies. At low redshifts (z < 1.3) our results agree well with 
models with no mass assembly and passively evolving stellar populations with a luminosity-weighted 
mean formation redshift Zf — 2.4 assuming a Kroupa initial mass function (IMF). We conduct a 
thorough investigation of systematic biases that might influence our results, and estimate systematic 
uncertainites of Azf =lo!i8 ( m °del normalization), Azf =1^05 ( a )' ariC ^ ^ z f = -o.45 (choice of stellar 
population model). For a Salpeter type IMF, the typical formation epoch is thus strongly constrained 
to be z ~ 2 — 3. Higher formation redshifts can only be made consistent with the data if on e 
permits an evolving IMF that is bottom-light at high redshift, as suggested by Ivan Dokkuml (2008). 
At high redshift (z > 1.3) we also witness a statistically significant (> 5a) disagreement between 
the measured luminosity function and the continuation of the passive evolution model from lower 
redshifts. After considering potential systematic biases that might influence our highest redshift data 
points, we interpret the observed deviation as potential evidence for ongoing mass assembly at this 
epoch. 

Subject headings: galaxies: clusters, galaxies: evolution, galaxies: formation, galaxies: luminosity 
function 



1. INTRODUCTION 

Two fundamental aspects of galaxy formation are the 
star formation history and the mass assembly history. In 
principle the star formation history is easier to measure 
because the effects of star formation remain visible even 
long after it has ceased. In contrast, the signatures of 
mass assembly are typically either short-lived or difficult 
to extract once the galaxy returns to virial equilibrium. 
This means that while the star formation history can 
be measured for individual galaxies, the mass assembly 
history is best measured from a population of galaxies, 
and these galaxies must be observed while they are still 
assembling. 

Studies of clusters often treat the galaxies as if they 
formed their stars in one short burst at high redshift, 
and so parameterize their star formation histories by the 
luminosity- weighted or mass- weighted mean age of their 
stellar populations. W hile this is certainl y not true in 
general (see for instance iTrager et al.ll2008T ) it is a useful 
simplification and works surprisingly well, especially for 
massive cluster galaxies. 

A number of different techniques have been used to 
measure the mean age of the stellar populations in galaxy 
clusters. The most straight-forward technique is to use 
spectral indices or model comparisons to galaxy spec- 
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tra to measure ages directly. I Thomas et a l. (2005) study 
early type galaxies in dense environments using such a 
method and find that the majority of the star forma- 
tion in massive galaxies occurred between z ~ 3 — 5, 
with vigorous star formation episodes from z ~ 2 — 5. 
Age estimates for cluster galaxies can also come from 
the cl uster red sequence in various ways. iKurk et all 
(2009) uses the color of the red sequence in a single 
z = 1.6 assem bling clust e r to e stimate a formation epoch 
of Zf ~ 3. iTran et all (|2007t l examine the small scat- 
ter of absorption line galaxies about the red sequence 
of a z=0.83 galaxy cluster, and conclude that all star 
formation ceased by z ~ 1.2 and that most of the mem- 
bers formed by z ~ 2. This is consistent with their own 
estimate of the member ages from the color of the red 
sequence. 

Meanwhile, the fundamental plane can be used to mea- 
sure the ages of stellar populations by observing the 
evolution in M/L ratios as a function of redshift. Per- 
forming such an analysis with three ga l axy c lusters at 
z ~ 0.5, Ivan Dokkum fc van der Marell (|2007t ) estimate 
a luminosity-weighted mean formation redshift for their 
galaxies of 2.OII5 17 • F° r surveys that have many clus- 
ters distributed over a range of redshifts, it is possible 
to examine the color or luminosity evolution of cluster 
galaxies ov er time to constra i n the primary star forma- 
tion epoch. lEisenhardt et al.l (J2008T ) explore a sample of 
clusters with photometric redshifts between < z < 2.4 
and find color evolution that is consistent with Zf > 3 
(with larger formation red shifts preferred for z > 1 clus- 
ters). iMuzzin et al.1 (J2008L hereafter M08) find a cluster 
sample with photometric redshifts between 0.1 < z < 1.3 
by searching for red sequence galaxies, and conclude that 
the luminosity evolution of their sample is consistent 
with passively evolving galaxies formed at z > 1.5. 
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It is important to note that luminosity-weighted mea- 
sures of stellar population ages will generally return 
later formation redshifts (and younger ages) than mass- 
weighted measurements, since young stellar populations 
contribute more luminosity per unit mass than old stel- 
lar po pulations. We note that of the papers quoted 
above, iThomas et al.l (|2005[ ) is the only one that pro- 
vides a mass- weighted age measurement, and so it is 
not surprising that it points to the earliest formation 
redshifts. It is hard to say whether the observed varia- 
tion between the luminosity-weighted age estimates rep- 
resents genuine differences in the stellar populations, dif- 
ferences in the methods, or signs of systematic errors. 
One recognized uncertainty that factors into many of 
these studies is the slope of the IMF (in particular near 
IMq) at high redshift. Since the rate of luminosity evo- 
lution is strongly de pendent upon the slope of the IMF 
(|Conrov et al.l 120091 ) . any of the above studies that esti- 
mate formation ep ochs based on the evolution of galaxy 
luminosities (e.g. Ivan Dokkum fc van der Marell 120071 



IM08I ) are strongly dependent upon the slope of the IMF. 
In general flatter IMFs at earlier epochs would push these 
studies to earlier formation redshifts. 

There is still much work needed on our picture of mass 
assembly in the most massive cluster galaxies. Past work 
on the luminosity evolution in cluster galaxies has typ- 
ically searched for deviations of the galaxies relative to 
models for passively evolving galaxies. Finding none, 
studies have concluded that there is little or no mass as- 
sembly in massive cluster galaxi es, out to the farthest 
redshifts studied , z < 1.3 (see IStrazzullo et al.l 120061 
iDe Propris et al.l 120071 lM08h . However it is not clear 
just how much mass assembly is ruled out by such stud- 
ies. Agreement with passive evolution models represents, 
at best, an upper limit on the amount of mass assembly 
allowed, but this limit has not been explicitly calculated. 
Another method of constraining mass assembly is to look 
at the build-up of the red sequence. The red sequence 
has the advantage of being relatively easy t o ide ntify, 
even at high redshift. Indeed, iKodama et al.l (J2007I ) find 
evidence that the massive end of the red s equence be- 
ins t o build up as early as 2 < z < 3, and IZirm et al.l 
2008) find a red sequence around a z = 2.16 protoclus- 
tcr. This suggests that the bright end of the cluster red 
sequence is established very early, a fact verified by well 



defined red sec 
one at z = 1.6 ( 
(|Papovich et a 



uences found in two assembling clusters, 



Kurk et a l. 2009) an d anot her at z = 1.62 
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In this paper we extend the work of previous re- 
searchers who have studied the evolutio n of the lu minos- 
ity fu nction of massive cluste r galaxies (IS trazzullo et al. 
l200l IPePropris et al.l 120071 iMuzzin et al.ll2007L lM08h . 
We make use of the Spitze r Deep, Wide-Field Survey 
f SDWFS. lAshbv et al.ll2009h . an 8.5 deg 2 survey with a 
limiting magnitude of 18.8 (5a) at 4.5/im (on the Vega- 
mag system) . This survey has sufficient multiwavelcngth 
data to calculate accurate photometric redshifts and reli- 
ably identify clusters and likely members. Because of the 
size of our survey area we have a larger cluster sample 
than previous work in the literature, and the depth of 
our survey allows us to find clusters out to high redshifts 
(0 < z < 2.4). 

This paper is structured as follows: Section 2 describes 
the data we are working with, Section 3 presents our 



luminosity functions, and in Section 4 we compare them 
to models for passively evolving stellar populations. In 
Section 5 we discuss the evidence for mass assembly in 
our results, and Section 6 contains our summary and 
conclusions. Throughout this work we assume a WMAP5 
cosmology » n = 0-279, »a = .721, and H = 70.1 km 
s _1 Mpc _1 (jHinshaw et al.ll2009f) . All magnitudes are on 
the Vega magnitude system. 

2. DATA 

2.1. Galaxy Catalog 

Our galaxy catalog for this work comes from SDWFS, 
which is a reimaging of the IRAC Shallow Cluster Sur- 
vey riSCS. lEisenhardt et alJl200l . The ISCS had 90 
seconds of integration time across 8.5deg 2 in the Bootes 
field, and SDWFS ad ded 3 more 90 second exposures 
at every pointing. lAshbv et al.l (|2009f ) describes the data 
reduction for SDWFS. The source catalog reaches depths 
of 19.77 and 18.83 mags in 3.6 and 4.5/im and contains 
670,446 detected sources at 3.6/im and 528,232 sources 
at 4.5/im (5a, 4" aperture corrected to total). We use 
4" aperture magnitudes corrected to total to derive pho- 
tometric redshift probability distribution functions from 
combined SDWFS (3.6, 4.5, 5.8, and 8.0/im) imaging and 
B W RI data from the NOAO Deep Wide-Field Survey 
(NDWFS). We work with a subsample of the SDWFS 
catalog consisting of sources which are brighter than 
18.8 mags in 4.5/im and have optical data. Our sub- 
sample consists of 454,418 sources for which we assign 
re dshift probability distr ibutions using the methodology 
of iBrodwin et al.l (|2006h . Comp aring to 15,052 galax- 
ies with spectroscopic redshifts, IBrodwin et al.l ([20061 ) 
find that their photometric redshifts are good to Az = 
0.06(1 + z) for 95% of their galaxies at < z < 1.2. 
We find a similar quality using the deeper SDWFS data. 
From simulations we expect our photometric redshifts to 
remain robust out to z ~ 3. Photometric redshift er- 
rors remain as low as cr z /(l + z) = 0.06 out to z = 1.2, 
and increases to a maximum error of cr z /(l + z) ^0.2 
at z = 1.7, dropping back down to er 2 /(l + z) ~ 0.15 
at z = 2.0. This has been verified with spectroscopic 
redshifts out to z < 1.5. 

2.2. Cluster Catalog 

We utilize a samp l e of 335 clusters from the ISCS 
(IStanford et alj|2005t lElston et al.l 120061: IBrodwin et al.l 
120061 : lEisenhardt et al.1 120081: iStern et al.l 12010ft . The 
identification of the c luster sample is described in 
lEisenhardt et al.l (|2008() . and we summarize the major 
features here. This cluster list was gene rated by using 
the re dshift probability distributions from lBrodwin et al.1 
(2006) to perform a wavelet analysis and identify clusters 
in the ISCS. With this method probability maps are gen- 
erated for fixed redshift slices, and "signal" is added to 
the probability maps at the location of the galaxies in 
proportion to the integrated probability of each galaxy 
being found in the redshift slice. Clusters are then iden- 
tified by searching for statistically significant peaks in 
these probability maps. Of the 335 cluste rs in this survey 
25% are confirmed, including 15 at z > 1 (IStanford et al.1 
120051 : lETsenhardt et al.1 12008). We restrict our analysis to 
0.3 < z < 2.0 limiting our sample to 296 clusters of which 
20% are confirmed. Confirmation of low redshift clusters 
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come s primarily from the AGES survey (|Kochanek et al.l 
2004). Since redshift probability distributions were the 
basis for this cluster search, the cluster catalog should 
be relatively robust against assumptions of galaxy type, 
the existence of a red sequence, or any other require- 
ments typically adopted by other cluster finding algo- 
rithms. Finally, we emphasize that this work relies on 
improved photmetric redshifts for our galaxies calculated 
from NDWFS+SDWFS data, though our cluster catalog 
was generated using photometric redshifts from the shal- 
lower NDWFS+ISCS data. 

3. OBSERVED LUMINOSITY FUNCTION 

3.1. General Procedure 

While using photometric redshifts allows us to con- 
struct a large galaxy catalog out to high redshifts, it 
is not without its disadvantages. Without spectroscopic 
redshifts we can only find cluster members in a statistical 
fashion. Instead of measuring the luminosity function of 
cluster galaxies directly, we must measure the luminos- 
ity function of galaxies near each cluster and correct for 
the presence of field galaxies. To accomplish this we first 
find photo-z cluster members and field galaxies for each 
cluster. Galaxies within 1.5 Mpc (physical distance) and 
near the cluster redshift are considered photo-z cluster 
members, and galaxies with 4 Mpc < d < 8 Mpc and 
near the cluster redshift are considered field galaxies. We 
then measure th e field luminosity function by fitting a 
ISchechterl (|1976f) luminosity function to the field galax- 
ies near each cluster. We parameterize the luminosity 
function of the photo-z cluster members as the sum of 
another Schechter (the cluster luminosity function) and 
the field luminosity function. For the actual fitting pro- 
cess we divide the clusters into redshift bins and calculate 
the fc-correction and distance modulus correction needed 
to move a passively evolving galaxy from the cluster red- 
shift (using spectroscopic redshifts for the clusters when 
available) to the bin redshift. We apply these corrections 
to the member and field galaxies for each cluster. We 
then apply the above procedure to the combined photo-z 
cluster members and field galaxies for all the clusters in 
each redshift bin. 

3.2. Fitting Details 

While the above outline describes our general fitting 
procedure, there are a number of important details that 
go into the fitting process. The first consideration is how 
to determine whether or not a particular galaxy is at the 
cluster redshift. For this we use the full redshift probabil- 
ity distribution functions for each galaxy. We calculate 
the probability that each galaxy would be found within 
±0.06(1 +z) of the cluster redshift. Galaxies that fall be- 
low a certain probability threshold are removed from the 
fitting process. We find that requiring a total probability 
> 30% provides a balance between removing field galax- 
ies without cutting too many cluster galaxies. However, 
we also redo our fitting process using cuts at 20, 40, and 
50% to investigate the impact this has on our results (see 
Sections I3~5"1 and l4*. 3. 4|) . Our choice of cut probability has 
only a minor impact on our results. Using a fixed cut at 
all redshifts will remove a larger fraction of galaxies at 
high redshift, because their photometric redshift proba- 
bility distributions are broader. To account for this we 



fix the cut probability at z = and allow it to decrease 
as a function of cluster redshift to account for the broad- 
ening of the photometric redshifts to high redshift. The 
cut decreases such that a galaxy that is at the cluster 
redshift and makes the cut at z — will not be excluded 
if it is at the redshift of a higher-z cluster. Also, this 
probability cut disproportionately removes faint galaxies 
because they have the largest photo-z errors at a given 
redshift. We correct for this effect by performing a sim- 
ple Monte Carlo simulation using the redshift probability 
distributions for the galaxies to estimate the fraction of 
galaxies lost as a function of magnitude, due to this prob- 
ability cut. We then weight the galaxies appropriately in 
our fits to account for this incompleteness. 

We must account for any potential overlap between 
clusters. We search for any clusters that fall within 10 
Mpc and Az = 0.06(1 + z) of a given cluster, and remove 
any photo-z cluster members or field galaxies that are 
within 2 Mpc of a contaminating cluster. We also cal- 
culate the relative size of the field and member regions 
for each cluster, accounting for any missing areas due to 
contaminating clusters, bad pixels, bright stars, or the 
edge of the survey. 

The clusters are divided into redshift bins with a width 
of Az = 0.15 starting at z = 0.3. Instead of considering 
the center of each bin to be the bin location, we use 
the median redshift of the clusters in each bin as the bin 
location. We combine high redshift bins to increase signal 
to noise, so our two highest redshift bins go from z = 
1.35— 1.63 and 2 = 1.63—2.00. We remove approximately 
12% of the clusters in our sample because a neighboring 
cluster is located within 1.5 Mpc and Az — ±0.06(l + z). 
We exclude another 3% because the density of galaxies 
near the cluster is no greater than in the field. Some 
of these are caused by large-scale structures in the field 
region, and the rest are likely to be false positives. Of 
the original 296 clusters we exclude a total of 46 for these 
various reasons, leaving 250 clusters which go into our 
fitting procedure. 

We parameterize the field luminosity function (fLF) 
as a Schechter function but our redshift cut should se- 
lect a population of normally distributed galaxies with 
a = 0.06(1 + z) centered at the cluster redshift. This 
means that the fLF should be similar to a Schechter dis- 
tribution except smoothed with a Gaussian kernel on a 
scale determined by our redshift selection criteria. For 
any individual cluster this smoothing length should be 
close to the total ^-correction and distance modulus cor- 
rection needed to move a galaxy from ±0.06(1 + z) to 
the cluster redshift. So for the fit to the fLF in a redshift 
bin we use a Schechter function smoothed by a Gaussian 
kernel. For each cluster we calculate the fc-correction and 
distance modulus correction needed to go from the clus- 
ter redshift to ±0.06(1 + z). The average of these values 
for the clusters in a redshift bin becomes the scale length 
of the Gaussian used to smooth the fLF. Typical values 
for this smoothing length vary from ~ 0.2 — 0.5 mags, 
and are larger at lower redshifts. Given this smoothing 
length, we fit a smoothed Schechter function to the fLF 
in each redshift bin. We perform a simple \ 2 minimiza- 
tion to the binned fLF, using a binsize of 0.3 mags. In 
addition the contribution of each field galaxy to the lu- 
minosity function is weighted by the ratio of the cluster 
area to field area for the cluster it originated from. We 
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parameterize the Schechter function as: 

N(m) = 0.41n(10)$*10-°- 4(m - m * )(a+1) exp(-10-°- 4(m - m * ) ) 

(1) 
While fitting both the cluster and field LF, we correct 

for photometric incompleteness from the SDWFS survey 
by weighting galaxies according to the mea n incomplete- 
ness measured as a function of magnitude ([Ashbv et al.l 
2009). For reference the completeness remains above 
75% (70%) for 3.6 (4.5)/xm for all our galaxies. For 
measuring both the field and cluster luminosity function, 
fitting is limited to galaxies brighter than the apparent 
magnitude limit of each redshift bin. The apparent mag- 
nitude limit of each bin is taken to be 18.7 (18.8) mags 
in 3.6 (4.5)/im, plus the minimum ^-correction plus dis- 
tance modulus correction applied to any cluster in that 
redshift bin. This correction is a function of redshift and 
varies between about —1.0 mag in the low redshift bins 
to ~ —0.2 mags at the high redshift end. 

We fit for the cluster luminosity function by summing 
a Schechter function and the fLF and fitting this to the 
luminosity function of the photo-z cluster members in 
each redshift bin. We use maximum likelihood fitting as 
an altern ative to a binned fit, f ollowing the general pro- 
cedure of lMarshall et al.l (|1983| ) . Our data are only deep 
enough to reliably constrain a in the lowest redshift bin, 
for which we find a = — 0.6 ± 0.2. Previous w ork in the 
literature has com monly found (|Lin et al.ll2004l hereafter 

L04) or assumed (|M08r i a 0.8 for clusters. This is 

consistent with the value from our lowest redshift bin, 
so we also assume and fix a — —0.8 in our fits for all 
redshift bins. In Section l"4. 3.21 we explore the sensitivity 
of our results to differe nt values of a. W e use a down- 
hill simplex algorithm (jPress et al.l 120071 ) to maximize 
the likelihood as a function of m* and $*. The relation- 
ship between the likelihood, to*, and $* is smooth with 
only one maximum, so our results are independent of the 
starting guess. 

Finally, we estimate errors in to* for each redshift bin 
using bootstrap resampling. We generate realizations of 
each redshift bin by randomly selecting N clusters from 
the bin with replacement, where N is equal to the num- 
ber of clusters in the bin. This number of realizations 
for each bin is such that every redshift bin will have at 
least N\og(N) 2 realizations, which has been shown to 
be a sufficient numb er for accurately estimating errors 
(|Babu fc S ingh 1983). We fit each realization in the same 
fashion as before. Error estimates come from the distri- 
bution of fitted ?7i* values to the realizations. We find 
the to* values that contain 68% of the fitted distribution, 
and use these as our errors. 

3.3. Statistical Background Subtraction 

We also measure the cluster LF using a more typical 
statistical background subtraction to verify that there 
are no large systematic errors in our results. We use 
the same cluster and field regions as before and create 
binned cluster and field LFs for each cluster, using a 
binsize of 0.2 mags. The fLF is then subtracted from 
the cluster luminosity function. We bin the clusters in 
redshift space, this time with a binsize of Az = 0.2, and 
apply fc-corrections and distance modulus corrections to 
move the LFs from the cluster redshift to the center of 



each bin. The field subtracted LFs are then added to- 
gether in a redshift bin, and a Schechter function is fit 
using Levenberg-Marquardt least-squares minimization. 
Errors are derived with bootstrap resampling. 

3.4. Results 

The fits to the cluster luminosity functions for each red- 
shift bin are shown in Figures Q] (3. 6/im) and[2](4.5/im). 
For clarity we plot in Figures [1] and [2] the (binned) dif- 
ference between the luminosity function of galaxies near 
the cluster and the field luminosity function in units of 
galaxies per magnitude. This is represented by solid cir- 
cles. The solid curve denotes the Schechter function 
with a = —0.8 fitted to the cluster luminosity func- 
tion, and the solid vertical line shows the fitted value 
of to* . The dotted line shows the fitted Schechter func- 
tion with a = —1.0, and the dashed vertical line is the 
apparent magnitude limit of the redshift bin. Figures Q] 
and [5] show that our fitted values of m* are ~2.5 mags 
brighter than the apparent magnitude limit of the sur- 
vey at low redshift. At high redshift the 4.5/xm data 
are ~1 mag brighter than the apparent magnitude limit, 
while the 3. 6/im data are only 0.5 mags brighter. Ta- 
ble Q] lists the best fitting m* values and errors for 
a = —0.6,-0.8,-1.0, and —1.12 as a function of red- 
shift. Table [5] lists the results calculated with our statis- 
tical background subtraction. 

3.5. Fitting Errors 

There are multiple potential systematic uncertainties 
related to the LF fitting procedure. The first comes 
from our choice of probability cut for determining which 
galaxies are at the cluster redshift. As mentioned in Sec- 
tion [221 we reject galaxies which have a total probability 
< 30% of being within Az = 0.06(1 + z) of the cluster 
redshift. Using a probability cut of 20% moves our re- 
sults brighter by ~ 0.05 mags, and cutting at 40% moves 
our results fainter by ~ 0.05 mags. A cut of 50% scatters 
our results randomly by ~ 0.1 mags. 

Second is the luminosity function completeness correc- 
tion which we have applied to account for the loss of faint 
galaxies due to our probability cut. If we do not include 
this correction, our fitted m* values move systematically 
brighter by ~ 0.05 mags. 

Third, our choice of a is an important source of fitting 
uncertainty in this study. For now we note that adjust- 
ing a produces a largely systematic shift in our results. 
In general if a becomes steeper by 0.1 then our fitted 
m* values move brighter by ~ 0.1 mags. We discuss the 
impact of our choice of a in more detail in Section 14.3.21 
because comparison to our models for passive galaxy evo- 
lution provides the best way to investigate the impact of 
a on our results. 

4. MODELS FOR PASSIVELY EVOLVING 
STELLAR POPULATIONS 

4.1. Model Description 

To measure the primary star formation epoch for 
galaxies in this work we compare our measured m* 
values against model predictions for a passively evolv- 
ing stellar population. We comp are the predictions 
for th ree different stellar model sets, iBruzual fc C hariot 
(|2003[ hereafter BC03), an updated version of |BC03 
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Fig. 1. — Schcchtcr fits to the 3.6/im cluster luminosity function in each redshift bin. The solid circles arc binned differences between the 
luminosity function for all galaxies within 1.5 Mpc of the cluster center at the cluster redshift and the area- weighted luminosity function of 
the field in units of galaxies per magnitude per cluster. The solid curve shows the best fitting Schcchter function and the solid vertical line 
shows the fitted value of m^ g assuming a = —0.8 in all redshift bins. The dotted curve shows the fit with a = —1.0. The dashed vertical 
line denotes the apparent magnitude limit in each bin. 
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Fig. 2. — The same as Figurc[T] but for 4.5/^m. 
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TABLE 1 

schechter flt paramaters 
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TABLE 2 

schechter flt paramaters 

Statistical Background Subtraction 



a 


z 


# Clusters 


^3.6 


Mis 


-0.80 


0.55 


43 


16.67 ± 0.07 


16.57 ± 0.11 


-0.80 


0.79 


35 


16.91 ± 0.10 


16.80 ± 0.08 


-0.80 


0.98 


37 


16.99 ± 0.09 


16.79 ± 0.09 


-0.80 


1.20 


27 


17.22 ± 0.17 


16.87 ± 0.19 


-0.80 


1.41 


17 


17.67 ± 0.22 


17.05 ± 0.21 


-0.80 


1.60 


13 


17.82 ± 0.23 


17.30 ± 0.14 


-0.80 


1.77 


3 


16.91 ± 0.57 


16.41 ± 0.68 




Fi g. 3. — A comparison b etwee n the models of |BC03 (dotted 
line), CB07 (dashed line), and M05 (solid line). The star formation 
epoch is assumed to be 2 = 2.5 for all models. Redshift is on the x- 
axis. Top panel is 3.6/xm, bottom panel is 45mn. All three model 
sets are normalized to match the results of lL04l at low redshift. 

with detailed treatment of thermally- pulsing asymp- 
totic giant branch (TP -AGB) stars (Ch ariot fc Bruzual 
Iprivate com munication, hereafter CB07), and the mod- 
els from lMarastonl (J2005L hereafter M05). Figure [3] com- 
pares the predictions of these three model sets at 3.6/xm 
(top), and 4.5/im (bottom). In this figure all the models 
are normalized to have the same M* value at z — 0.05 
in 3.6u m ( M* R ), which we discus s in mo re detail b elow. 
ICB07I and lBC03l use a iChabrierl (l_003h IMF and IMOU 
uses a Kroupa IMF. For all three model sets we assume 
solar metallicity, and in Section f4. 3. 31 we investigate the 
results of changing IMF or metallicity. We characterize 
the star formation history as an exponentially declining 
burst of star formation with a characteristic timescale of 
0.1 Gyr, effectively a single burst model. Since our goal 
is to compare our results against models for passively 
evolving stellar populations, we concentrate on this one 
"bursting" model of star formation and do not attempt 
to model more complicated star formation histories. We 
compare all three model sets in Figure[3l which shows the 
predicted evolution in3J3um w hen st ar formation turns 
on at z = 2.5. The |BC03j and [CB07J models agree well 
at l ow red shift, but start to diverge at higher redshifts. 
The lCB07l models are brighter than the BC03 models at 
higher redshift, due primarily to a m ore im proved treat- 
ment of TP-AGB stars in the [CBQU models. IM05I . which 
also includes an upda ted treatment of TP-AGB stars, 
is brighter than ICB0 7 at z ~ 0.6 , and lacks the dip at 
z ~ 1.8 that both CB07 and lBC03lhav e. For the remain- 
der of this paper we focus upon IM05I but also compare 
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Fig. 4. — Measured evolutio n of m % 6 (top) and m\ 5 (bottom) 
versus redshift compared with [M05 model predictions. We show 
our results from both our photo-z selection method and statistical 
background subtraction. The models are normalized to match the 
results of L OJ a t low redshift. Also included in the top plot are the 
results oflMOS 



our results with IBC03I and ICB07I (Section |4~373)) to de- 
termine the systematic errors introduced by our choice 
of model. 

With all three model sets we build a range of models 
which differ only by their peak star formation epoch. We 
use peak star formation redshifts that span the range of 
Zf = 1.0 to Zf = 9.95 in redshift increments of Az = 
0.05. Previous studies have used measurements of the 
galaxy luminosity function at low redshift to normalize 
such models out to high redshift. However our data are 
of sufficient quality that we can fit for both the model 
normalization and formation redshift at the same time, 
allowing for a more self-consistent result. For all of our 
model fits we will let the model normalization be a free 
parameter. However in some cases (such as Figure [3J 
we show models without having performed a fit to the 
data. For simplicity in these cases, we fix the model 
normalization using the results from IL04J for which we 
calculate M£ 6 = —24.32 for galaxies at a mean redshift 
of z = 0. 05. T his comes from taking the original result 
found by Ell in Ks band {M* Ks = -24.02) and using 
our models to convert from rest-frame Ks to observed 
3.6/im. 

4.2. Model Comparison to Cluster LF 

The measured m* values from our photo-z selection 
method with a fixed at —0.8 are plotted against redshift 
in Figure |4j A small subsample of the passive evolution 
models are overplotted (with the normalization fix ed ac- 
cording to IL04J ) . Figure 2] includes the results of lM08l 
Our error bars are smaller as expected because of our 
use of photometric redshifts to select galaxies (as op- 
posed to the statistical background subtraction used by 
M08), and because of t he la rger area and greater depth 
of the SDWFS survey. MP S] used d ata from the Spitzer 
First Look Survey (|Lacv et ; alJ2005[ ). a 3.8 deg 2 field with 
l/6th of our integration time. In Figure|3]we also include 
the results of our statistical background subtraction and 
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find excellent agreement between our photo-z selection 
method and our statistical background subtraction. For 
this reason we will concentrate on the results from our 
photo-z selection method and only return to our statis- 
tical background subtraction when investigating the im- 
pact of systematic uncertainties fSection l4.3.4j) . Figure[S] 
compares our results with the predicted color evolution. 
The color evolution is not useful for constraining the for- 
mation epoch because there is little difference between 
the models. Instead we note that the color evolution 
we observe is consistent with the model predictions and 
hence serves as a useful check of our results. 

To measure the actual formation redshift we perform a 
simple x 2 minimization between the data and the mod- 
els, allowing the model normalization to float. The best 
fit model has Zf = 9.95, which is the earliest forma- 
tion redshift we attempted to fit. xt f° r this best fit 
is 4.63, which excludes the best fitting model for pas- 
sive evolution at > 5er. The reason for this discrepancy 
is illustrated by comparing our observed evolution with 
the best fitting model, shown in Figure [6] The top left 
and top right plots show the observed evolution and best 
fitting model with Zf = 9.95 for 3.6 and 4.5/im, respec- 
tively. The lower left and lower right panels show the 
residuals of the fit. The large \ 2 ls clearly caused be- 
cause the z > 1.4 data points are faint relative to the 
rest of the data, and so the models miss both the low 
redshift (z < 1.3) and high redshift (z > 1.3) data. A 
close inspection of Figure @] shows that no combination 
of formation redshift and model normalization can match 
both the low redshift and high redshift data. 

For this reason we redo the \ 2 fit without the two 
highest redshift data points. The resulting fit is plot- 
ted in Figure [7] The preferred formation redshift is now 
Zf = 2.4_q'j|. The quality of the fit is substantially im- 
proved (xt = 1-5)- An analysis of x 2 residuals reveals 
that both the high redshift 3.6/im data and the 4.5/im 
data reject the best fitting Zf — 2.40 model at very high 
confidence (> 5a). There are two plausible reasons for 
this discrepancy at high redshifts. First, there is always 
the possibility that some systematic error becomes im- 
portant at high redshifts. Second is that this is a signa- 
ture of galaxy assembly, which would cause M* to grow 
fainter at higher redshifts as the galaxies break up into 
their progenitors. We explore both of these possibilities 
more in Section [572] Either way we will exclude the high- 
est redshift data points from all further analysis involving 
our models for passive stellar evolution. 
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Fig. 6. — Best fitting model using the entire cluster sample out to 
z = 1.86. Redshift is plotted versus 7713 6 (top left) and m\ 5 (top 
right). The solid line is the best fitting model for passively evolving 
stellar populations with zt = 9.95. The models are normalized to 
have M| 6 = —24.71 at z = 0.05, as determined from the fitting 
process. The circles correspond to the measured evolution of m» 
with a fixed to —0.8. Differences between the data and the models 
are plotted for 3.6/im (bottom left) and 4.5/im (bottom right). 

4.3. Systematic Uncertainties 

There are a number of systematic uncertainties that 
affect our ability to accurately measure the formation 
redshift. We explore these in detail below. 

4.3.1. Model Normalization (M^ e ) 

The most obvious source of systematic bias is model 
normalization. As discussed above, we let the model 
normalization remain a free parameter in all of our fits, 
allowing us to quantify the error introduced by an un- 
certain model normalization. We can instead use pub- 
lished measurements of the galaxy luminosity function 
at low redshift as an additional check of our results. To 
allow for more direct comparisons we parameterize our 
fitted model normalizations by the predicted M£ e value 
at z = 0.05. We have calculated the same value for IL04L 
allowing us to compare directly to their results so long 
as we use the same value of a. We have also searched 
the literature for other measurements of the galaxy lumi- 
nosity function at low redshift. In Table [3] we list results 
fromlBellet al.l(l2003h . [L04, Ba bbedge et al.l (|2006() . and 
iDai et all (|2009l hereafter D09) which cover a range of 
values of a, a range of redshifts, and use both 3.6/im and 
ifs-band. We use our models to convert from ifs-band 
to 3.6/xm as necessary, to correct for the predicted pas- 
sive evolution from the survey redshift to z = 0.05, and 
to move the results from restframe 3.6/zm to observed 
3.6/jm. We therefore calculate observed M| 6 at z = 0.05 
for all the papers in Table [3l allowing us to compare more 
directly to other results. 

We begin the comparison of our fitted model normal- 
izations to the literature by looking at the fits to our 
a = —1.12 data. While this is not our fiduc ial value 
for a (—0.8) we begin here because both ID09I and IL04I 
have res ults for a ~ —1.12, and because the differ ences 
between ID09I . IL04I . and our work are instructive. ID09I 
looked at field galaxies in the ISCS. Our survey, SD- 
WFS, is a reimaging of the ISCS. |L04| ex amine d cluster 
galaxies in Ks band. So while the galaxies |L04| examines 



Fig. 7. — Same as Figure [6] except the two highest redshift bins 
(open circles) have been excluded from the fit. The new best fit 



model has z 



f 



-24.47. 



are in a si milar e nvironment to the galaxies we study, the 
galaxies in ID09I were observed at the same wavelengths, 
with the same instrument, and some of them are even 
the same galaxies as ours. Given these facts we illustrate 
in Figure|8]the impact of the model normalization on our 
results. In this Figure we show 1,2, and 3<r confidence 
regions in M£ 6 versus Zf space, with the best fit to the 
low redshift data (z < 1.3, z f = 2.80, M£ 6 = -24.91) 
marked with a filled circle. There are two sets of solid 
and dashed vertical lines, rep resen ting the p redicted val- 
ues of M 3 * 6 at z = 0.05 for EqI and ID09I (see Table [3] 
for the precise values). Clearly our model nor maliz ation 
is in excellent agreement with pre dicti ons for ID09I . and 
it is syste matically brighter than IL04L The agreement 
with D09 is encouraging and, since they used the same 
instrument and looked at the same field, not surprising. 

Similar to Figure El we show in Figure |9] the 1,2, and 
3cr confidence regions in MJ 6 versus Zf space for the fit 
to our results with a — —0.8, for which we find zj — 
2.40 and Mg 6 = - 24.47. We also show the predicted 
M 3 * 6 val ue for |L04J for th eir fit with a = -0.84. We 
note that lBell et al.l (|2003T ) also me asured M* i n the l ocal 
universe finding a = —0.77. For iBell et all (J2003I ) we 
calc ul ate M-* K = —24.3 2, the same as what we find for 
|L04 IBell et all (|2003j) examined galaxies in the same 
band as lL04l (Ks), so the studies with Ks data in Table 
[3] are internally consistent. Similar to our results for t he 
a = —1.12 data, the calculated M^ 6 value for |L04| is 
fainter than our fit, this time by 0.15 mags. We note 
that because we have fit for the model normalization our 
formation redshift is robust against any systematic errors 
in ?7i*. Thus, even if this disagreement means that there 
is some unaccounted for systematic errors in m* , our best 
fitting formation redshift will remain unchanged. On the 
other hand, if we had fixed our model normalization to 
match L04 this would have artificially forced our fit to 
later formation redshifts. 

Figure |9] demonstrates that we can place tight con- 
straints on the formation redshift even though we have 
left the model normalization as a free parameter. We 
constrain the star formation epoch to be within 2.0 < 
Zf < 3.0 at 3<7 given our choice of model set and a. 
For comparison we include Figure [10] which shows the 



10 



Mancone, Gonzalez, Brodwin, Stanford, Eisenhardt, Stern, Jones 



TABLE 3 
Model normalizations from the literature. 



Reference 



Band 



Al* 



0.05 



D09 


3.6/^m 


0.235 


-1.12 


-25.06 ± 0.18 


-24.99 


Babbedee et al. (2006) x 


3.6/im 


0.13 


-0.9 


-24.67 ± 0.10 


-24.72 


Babbedge et al. (2006) L 


3.6/im 


0.38 


-1.0 


-25.07 ± 0.10 


-24.85 


L04 2 


Ks 


0.043 


-0.84 


-24.02 ± 0.02 


-24.32 


L04 2 


Ks 


0.043 


-1.1 


-24.34 ± 0.01 


-24.64 


Bell et al. (2003)^ 


Ks 


0.08 


-0.77 


-24.06 


-24.32 



"Schecht er pa rameters were not listed in the original paper, but instead come from 
Table 5 of lDOU 
'L04 both fit for a finding a = — 0.84 and fixed a = 
c No error estimate is available for Bell et al. (2003). 



-1.1. 



constraints on model normalization and formation red- 
shift that result from our statistical background subtrac- 
tion (Section I3.3|) . The best fitting formation redshift, 
Zf = 2.10^0 ioi is consistent with the result from our 
photo-z selection method. A comparison of Figures H] and 
[TU] shows that while the la confidence region for the sta- 
tistical background subtraction is actually smaller than 
for our photo-z selection method, the photo-z selection 
process places tighter constraints overall. We also note 
that, unlike the photo-z selection method, the statisti- 
cal backgrou nd su btraction does in fact match the nor- 
malization of lL04 This difference is caused because the 
statistical background subtraction is missing the lowest 
redshift data point, which places the best constraints on 
the model normalization. Therefore, the statistical back- 
ground subtraction can have a different best fitting model 
normalization, despite the fact that there is good agree- 
ment between the results of our statistical background 
subtraction and photo-z selection method (see Figure @|. 

4.3.2. Faint End Slope (a) 

Since we have assumed a value of a and fixed it in our 
fits, it is important to examine the impact of a on our 
results. This is particularly important because there is 
a strong correlation between to* and a. We have fit our 
data using a variety of values for a, —0.6, —0.8, —1.0, and 
— 1.12. This covers the full range of values for a listed 
in Table [3] and commonly found in the literature. As 
can be seen in Table Q] the behavior of to* as a function 
of a is well behaved and smooth. In general a change 
in a of 0.1 corresponds to a systematic change in to* of 
~ 0.1 mags. The change in to* is largely constant as 
a function of redshift, and so the observed shape of the 
evolution in the luminosity function changes only slightly 
as a function of a. 

We fit models to our results for —0.6 < a < —1.12, 
and list the results in Table |U The most important fact 
is that the best fitting formation redshift is fairly robust 
against changes of a. Changing a has only a small impact 
on the observed evolution of m* , and as a result changing 
a primarily changes the fitted model normalization. The 
best fitting formation redshift varies from Zf — 2.80 for 
a = —1.12 to Zf = 2.40 for a = —0.6 while at the same 
time our fitted to* values systematically shift by ~ 0.6 
mags over this same range of a. 

We already mentioned in Section 14.31 that our fitted 
mod el nor malization for a = —1.1 2 is in good agreement 
with lD09l but disagrees with lL04 We note now that we 



also have good agreement with iBabbedge et al.l (|2006|) . 
For their results we calculate M£ 6 = —24.72 ± 0.1 and 
M 3 * 6 = -24.85 ± 0.1 for their z = 0.13 and z = 0.38 



data points, which have a = —0.9 and a 



-1.0. For 



a = -1.0 we find M| 6 = -24.75 ± 0.04, which is within 
the errors for both. This means that our fitt ed m odel 
normalizations agree with both 3.6/im surveys ()D09l and 
IBabbedge et al . 2006) but disagree with both Ks surveys 
(|L04| and iBell et al.ll2003l) over a range of values of a. 

We use the fitted formation redshifts across the range 
of a studied in this paper as an estimate for the sys- 
tematic error introduced by a. Therefore we find that 
uncertainties in a introduce an uncertainty into our best 
fitting formation redshift of Azf =t.o '05 ■ 

Finally we note that the necessity of fixing a in all red- 
shift bins introduces an important limitation - we clearly 
cannot measure any evolution in a. Moreover, the cou- 
pling between to* and a means that if there is any evo- 
lution in a then our fitted values for to* will not agree 
with what would be measured using the proper value of 
a. However this has only a small impact on our ability to 
measure the formation redshift for massive galaxies. As 
long as we choose a consistent value for a at all redshifts 
we can accurately measure any relative change in the 
bright end of the LF as a function of redshift, and there- 
fore we can still properly measure the formation redshift 
of massive galaxies. 

4.3.3. Stellar Population Synthesis Modeling (SPS) 

Our choice of SPS model set is clearly an important in- 
gredient in our analysis. To assess the sensitivity of our 
results to this ingr edient we r edo ou r y minim ization 

using the models of lBC03l and[CB07 : . For lBCOjLw c find 
7 , — 1 qc;+ - 21 M* — —94 K7+O.04 

-,+0.19 71 f* 
-0.19' JW 3.6 



and for CB07 we 



find Zf = 2.70+£^, M 3 * 6 = -24.57 ± 0.04 (compared to 
our fiducial fit of z f = 2.40+°;^, M 3 * 6 = -24.47 ± 0.04). 
These results are also summarized in Table [S] where we 
list both best fitting formation redshift, model normal- 
ization, and xt f° r an three model sets. We use the dif- 
ferences between these results to estimate the systematic 
error introduced by our choice of model (Azf =^0.45)- 

We also investigate the effect of common sources of 
systematic biases in the models, such as the parameter- 
ization of the IMF and metallicity. If we use a Salpetcr 
IMF instead of Kroupa the best fitting formation redshift 
moves to Zf = 2.15. Using both a super-solar and sub- 
solar metallicity moves our results to earlier formation 
redshifts (zf = 2.8 and Zf = 3.00, respectively). These 
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Fig. 8. — Confidence regions for the formation epoch versus 
model normalization excluding the two highest redshift bins, using 
our results with a = —1.12. The solid circle denotes the best fit 
of M| 6 = -24.91, Zf = 2.80. The contours represent 1,2 and 3<r 
confidence intervals for our data set. The solid vertical and dotted 
lines represent the normalization and errors from L04J and |P09l (see 
Table [3] for the precise values). 



results are also summariz e d in T able El 

Recently IConrov et al.l (|2009f) performed a thorough 
investigation of potential systematic uncertainties in SPS 
modeling. For our purposes the most important uncer- 
tainty they found was the logarithmic slope of the IMF 
around 1M Q . For old stellar populations the mass of the 
main sequence turn off (MSTO) is about 1M©, and the 
slope of the IMF at the MSTO is an important factor for 
determining the evolution of the luminosity of a stellar 
population. IConrov et al.l ()2009l ) estimates the impact 
that the uncertainty in slope of the IMF near 1M© has 
on SPS modeling and concludes that the level of sys- 
tematic bias in the predicted evolution of the luminosity 
function in the current generation of stellar models could 
be as large as ~ 0.4 mags per unit redshift. To account 
for this we add to our models a simple prescription (Con- 
roy, private communication) to describe the relationship 
between the slope of the IMF at 1M© and the evolution 
of the models (dmag/dz) in 3.6 and 4.5/iin, and fit for 
Zf as a function of the IMF slope. For the fit we use 
our data with a fixed at —0.8 and fix the model nor- 
malization to A/g 6 = —24.47. The result is shown in 
Figure fTTI which plots 1,2, and 3er confidence intervals 
around the best fitting combination of IMF slope and 
Zf. A Salpeter slope in these units is 2.35. Clearly there 
is a strong degeneracy between the slope of the IMF and 
the best fitting formation redshift. If the slope of the 
IMF near 1M© is flatter at high redshift, then the actual 
formation redshift can be substantially earlier. 

In this contex t it is important to consider the work 
of Ivan Dokkuml (|2008h who examined the evolution of 
both the color and mass-to-light ratio of cluster galaxies. 
Because the luminosity evolution of galaxies is strongly 
dependent upon the IMF and the color evolution is not, 
Ivan Dokkuml ([2008) was able to place joint constraints 
on the IMF and the formation redshift of the cluster- 
galaxies in his sample. Looking at just the evolution of 
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Fig. 9. — Same as Figure \E\ except we have fitted our data with 
a = -0.8. The solid circle denotes the best fit of M| 6 = -24.47, 
Zf = 2.40. The vertical lines represent the normalization and error 
from ILin et al.l 11200? ) with -24.32 ± 0.02 and a = -0.84. Our 
fitted model normalization is consistent with the results from D09, 
but disagrees with L04. 

the mass-to-light ratio and assuming a Salpeter IMF he 
found good agreement to models with Zf = 2.0, simi- 
lar to what we have found from the luminosity evolution 
of our sample. However this same model could not fit 
the color evolution in his sample for which an earlier for- 
matio n redshift, Zf — 6.0, was necessary. Ivan Dokkuml 
(2008) found that the color and mass-to- light evolution 
could only be simultaneously fit by a flat IMF and an 
early formation redshift (zf = 6.0). They conclude that 
the slope of the IMF at high redshift is much flatter than 
a Salpeter IMF, which if true means that our fitted for- 
mation redshift (zf = 2.4) is really only a lower limit 
on the format i on red shift, as per Figure [TT] As noted in 
IConrov et al.l (|2009t ). this degeneracy between the IMF 
and the formation redshift is common to all studies that 
rely on the luminosity evolution of a stellar population. 
Clearly then the slope of the IMF at high redshift is the 
most important systematic uncertainty in this study, as 
well as similar ones in the past. With a sufficiently flat 
IMF, our results can fit arbitrarily early formation red- 
shifts. 

The other major uncertainties discussed in this paper 
introduce relatively small systematic errors: Azf — "'■'■'' 

(model normalization), Azf =_o'.o5 ( a ); an d Az/ 
(model set). Added in quadrature this gives a total er- 
ror of Azj — ±0.5. While our error bars leave room to 
fit earlier formation redshifts, it is clear that a flat IMF 
can lead to even earlier formation redshifts. This could 
account for any apparent disagreement between our re- 
sults and the work of Eisenhardt et al. (2008) who looked 
at the color evolution of this same sample of clusters. 
lEisenhardt et all (|2008f l find z f = 3 with a trend to- 
wards earlier formation redshifts for higher redshift clus- 
ters (z > 1). If the IMF is flatter at higher redshifts, 
then the color evolution of the clusters should point to 
an earlier for mation redshift than the luminosity evolu- 
tion. As with Ivan Dokkuml (J2008) we can use this fact to 
place joint constraints on the formation redshift and the 
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Fig. 10. — Same as Figure [9] except the models have been fit to 
our results using a statistical background subtraction. The solid 
circle denotes the best fit of MX , 



-24.33, Zf = 2.10. 

IMF by combining both the luminosity and color evolu- 
tion of our clusters. This will be an important goal for 
our future work. 

4.3.4. Fitting Errors 

Finally we study in detail the impact of the systematic 
biases introduced into our fitting process which were dis- 
cussed in Section 13.51 We apply the same x 2 minimiza- 
tion proc ess to each variation of our fitting procedure 
using the IM05I models with a fixed at —0.8 and calcu- 
late the best fitting formation redshifts, model normal- 
izations, and errors. We list the results in Table S] along 
with xt f° r each fit. Figures [12] and [13] directly com- 
pare all our systematic biases graphically by showing the 
results from Tables H] and \5\ We plot the best fitting 
formation redshift and errors for both fitting systemat- 
ics and model systematics in Figure [12] The best fitting 
normalizations and errors are shown in Figure IT51 Con- 
sidering the variety of systematic uncertainties that we 
have investigated, the general agreement found in Figure 
[12] was not a foregone conclusion. All of our best fits fall 
in the range of 1.9 < Zf < 3.0. 

Could the good agreement between the various sys- 
tematic biases be an artifact of our choice to let the nor- 
malization float? Any signs of a discrepancy might show 
up in the model normalization instead of the formation 
redshift. However Figure [131 which compares the fit- 
ted model normalizations in the same way as Figure 1121 
demonstrates that this is not the case. The largest out- 
liers correspond to changes in a. As discussed in Section 
14.3.21 we expect large changes in the normalization for 
different values of a because of the strong correlation 
between a and m*. Our consistency check with results 
from the literature (see Sections 14.3.11 and 14.3.21) shows 
that these normalizations are consistent with their ex- 
pected values from other 3.6/im studies, and so there is 
no evidence of a problem. 

Overall the consistency found in Figures [12] and [13] is 
quite remarkable, and is a testimony to the robustness of 
our results and the quality of our data. At this point the 
most important systematic uncertainties are the slope of 




Logarithmic slope of IMF 

Fig. 11. — 1, 2, and 3<r confidence regions for the formation epoch 
versus the logarithmic slope of the IMF. excluding the two highest 
redshift bins. For these fits we have used the data with a fixed to 



-0.8 and have fixed the model normalization to M. 



:-!.(; 



-24.47, 



taken from our fiducial fit to the data. In these units a Salpeter 
slope is 2.35. 

the IMF near 1 M and our assumed value of a. Future 
work which desires to build upon our results should con- 
centrate on constraining a and the IMF at high redshift. 

5. MASS ASSEMBLY 
5.1. Low Redshift 

We also examine the impact of ongoing mass assembly 
in our low redshift bins. While our results are well fit 
by models for passively evolving stellar populations, this 
does not rule out the possibility of galaxy assembly. To 
test for the presence of mass assembly in our low redshift 
galaxies, we create a simple model for luminosity evolu- 
tion in which the same models for passive evolution are 
used but an additional dimming factor of (1 + z) 7 is in- 
troduced to account for the breakup of galaxies at higher 
redshift. We fit to the z < 1.3 redshift bins, use the 
data with a = —0.8, and fix the model normalization to 
Mg 6 = —24.47. We find the best fitting combination of 
formation redshift and 7(z/ = 2.40,7 = 0) and show the 
1, 2, and 3er contours for this fit in Figure ITU Our best fit 
demonstrates clearly that we have no need for additional 
mass assembly ( at least for this toy model) , in agreement 
with past work (jStrazzullo et al.l [20061 iDe Propris et al.l 
l2007f ). However it is also clear that our results can fit a 
small amount of mass assembly. 7 ~ 0.2 is consistent at 
la, and 7 ~ 0.35 is consistent at 3o\ These correspond to 
~ 90% and ~ 75% of the final galaxy mass being assem- 
bled by z = 1.0. Considering that we have constructed 
only a very simple toy model for mass assembly at low 
redshifts, these numbers should not be considered au- 
thoritative. However this should make it clear that there 
is room for growth of massive galaxies at low redshift, 
albeit not much. 

Finally we note that the presence of mass assembly at 
low redshift has an impact on the best fit formation red- 
shift. Since mass assembly causes the galaxies to grow 
fainter out to higher redshift, accounting for mass assem- 
bly causes the models to grow steeper and therefore fits a 
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TABLE 4 
Impact of Fitting Systematics. 



Systematic Uncertainty 



-/ 



M 



:s.g 



XT- 



Statistical Background Subtraction 2.10 

Fix a = -0.6 2.40 

Fix a = -0.8 2.40 

Fix a = -1.0 2.80 

Fix a = -1.12 2.80 

Cut galaxies with probability < 20% 2.50 

Cut galaxies with probability < 30% 2.40 

Cut < 30% with no LF correction 2.40 

Cut galaxies with probability < 40% 2.15 

Cut galaxies with probability < 50% 2.40 
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0.10 
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0.18 
+0.16 

0.18 
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0.91 
1.33 
1.50 
0.94 
0.79 
1.17 
1.50 
1.56 
1.32 
2.72 



TABLE 5 
Impact of Model Choice. 



Model Set IMF Metallicity 



z f 



A /./ 



M05 


Kroupa 


Solar 


2.40 
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Solar 
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Fig. 12. — Best fitting formation redshift for fitting systemat- 
ics and model systematics as given in Tables [4] and \E\ The solid 
and dashed lines represent our chosen fiducial fit and errors of 
2.40+q;j|. We note that both the a = -0.8 data point and the 
P > 30% data point represent the same fit to the same data (our 
fiducial fit). It has been included twice to illustrate the dependence 
of our results on both a and the cut probability. 

later formation redshift. As can be seen in Figure HH our 
data and toy model could allow for formation redshifts 
as late as Zf ~ 2.1 at la, or Zf ~ 1.9 at 3a. 

5.2. High Redshift 

We noted in Section B~2"1 that our best fit model deviates 
(in a statistically significant way) from the two highest 



Fig. 13. — The same as Figure H"2l except for the best fit model 
normalizations (Mi 6 ) and errors for the various model systematics. 
The solid and dashed lines represent our chosen fiducial fit with 
-24.47 ±0.04. We note that both the a = -0.8 data point and the 
P > 30% data point represent the same fit to the same data (our 
fiducial fit). It has been included twice to illustrate the dependence 
of our results on both a and the cut probability. 



redshift data points. There are two possible explanations 
for this phenomenon: either a systematic error is impact- 
ing the high redshift data or this is a signature of galaxy 
assembly. We have already discussed a number of possi- 
ble systematic errors in both the fitting process (Section 
3.5p and in our models (Section l4.3p . We find that the 
disagreement between the models and data at high red- 
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Assembled Fraction by z = 
0.87 o.r 




Fig. 14. — 1, 2, and 3<r confidence regions for the formation 
epoch versus 7, which parameterizes the rate of mass assembly in 
low redshift galaxies. In this model galaxies are assumed to lose 
mass at the rate of (1 + z) 1 out to higher redshifts. The bottom 
axis shows the values of 7 fitted, and the top axis shows the fraction 
of the final galaxy mass that is assembled at z = 1.0 for a given 
value of 7. The solid circle shows our best fit of zt = 2.40, 7 = 0. 

shift is a consistent feature found for all the variations 
of model and fitting parameters listed in Tables 2] and [5j 
Moreover we find that this disagreement remains for all 
of the variations of the IMF slope discussed in Section 
14.3.31 and is robust against changes in redshift bin sizes 
and locations. Evolution in a is also unlikely to explain 
this result, as it would require the LF to be significantly 
steeper at high redshift. 

A more subtle systematic uncertainty could be the 
cause of this high redshift deviation. For instance, our 
photometric redshifts begin to degrade at z ~ 1.5, where 
the models and data begin to disagree. Also, the most 
distant spectroscopically confirmed cluster is at z=1.48, 
which is near the point where the models no longer match 
the data. It is possible that the false positive rate for 
our cluster search grows rapidly at high redshift, at too 
high of a redshift for the problem to be detected with 
our data. However it is not clear how either of these 
two possibilities would cause the data to grow system- 
atically fainter by ~1 magnitude. We have performed a 
simple test using our highest redshift data bin that still 
matches the models (z = 1.24). We scattered all the 
photo-z probability distributions to simulate what would 
happen if the photo-z errors were underestimated by a 
factor of ~ 3, and we have moved galaxies from random 
redshifts into the bin to simulate the presence of catas- 
trophic failures in the photometric redshifts. We added 
enough such that at least 5% of the galaxies in the bin 
are catastrophic failures. Despite these drastic changes 
the fitted value of m* for the z = 1.24 bin only scattered 
within its error bars. This suggests that random errors in 
the photometric redshifts cannot be responsible for such 
a large systematic offset. 

Systematic errors in the photometric redshifts are more 
likely to cause this discrepancy, but do not seem adequate 
to account for it. If the photometric redshifts are system- 



atically underfitting the redshift of high-z galaxies, then 
the galaxies will appear fainter than expected because 
the selected galaxy population will be at a higher red- 
shift than claimed. This would also explain why there is 
no apparent discrepancy found in the colors of the high 
redshift data (see Figure EJ, since there is little color 
evolution in the models for z > 1.5. However in order to 
explain the ~ 1 mag offset, the galaxies in the z = 1.5 
bin would have to have a mean redshift of z > 2.0. Such 
a large error seems unlikely, especially since there is no 
evidence of z — 1.5 galaxies ending up in the z — 1.24 
redshift bin. Similarly, we see no reason why an increas- 
ing false positive rate would cause a systematic shift to 
fainter magnitudes for our data. Finally we note that 
in Figure [4] the results of our statistical background sub- 
traction also show evidence for this same disagreement 
at high redshift. This is particularly significant because 
photometric redshifts do not factor into the statistical 
background subtraction, and because it should in gen- 
eral be less prone to systematic errors than our photo-z 
selection method. We conclude that there is no strong 
evidence that systematic errors cause the disagreement 
between data and models at high redshift. Further work 
is required to investigate the cluster LF in detail at high 
redshift, and for now we will discuss the implications of 
our result if these deviations are real. 

The discrepancy at high redshift is easiest to explain 
by ongoing mass assembly. One point in favor of growth 
through mergers is the lack of color evolution, relative 
to the models, for the high redshift clusters. As can be 
seen in Figure the color of an m* galaxy matches the 
models over the entire redshift range of our sample, de- 
spite a ~ 1 mag deviation in apparent magnitude at high 
redshift. As long as the galaxy assembly process doesn't 
appreciably change the luminosity weighted mean age of 
the stellar populations the color of the clusters will con- 
tinue to match the models, even while the luminosity 
increases due to the increase of mass. 

If we assume that all of the deviation at high redshifts 
comes from galaxy assembly and that there is no sig- 
nificant assembly in the low redshift bins, then we can 
estimate the fraction of the final galaxy mass that is as- 
sembled in our last two bins. We simply assume that the 
fractional decrease in luminosity relative to the models 
correlates directly with the fraction of mass "lost" to as- 
sembly. This implies that the galaxies grow by a factor 
of ~ 2 - 4 from 1.3 < z < 2.0. 

Ours is the first survey to track the evolution of the 
luminosity function of cluster galaxies to such high red- 
shifts, so there is little in the literature to which we 
can directly compare our result for clusters. This re- 
sult does howeve r appear to be at odds with what is 
found in the field. Illbert et al.l (J2010) for example exam- 
ine the stellar mass function of quiescent field galaxies 
over this same redshift range, finding little evolution in 
M* (the characteristic mass of field galaxies) at high red- 
shift, 1.0 < z < 2.0. A number of other studies have also 
found that the high mass en d of the field population is 
full y formed by z ~ 2 (e.g . iPerez-Gonzalez et alj|2008l 
and iMarchesini et all l2009f) . It would be surprising for 
the cluster galaxies to still be assembling when the mas- 
sive field galaxies are in place; however we are unable to 
identify any systematic error that would qualitatively al- 
ter our result. Our z j 1.3 results indicate that the z ~ 2 
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era was likely an active one in cluster galaxy formation. 
Further investigation of the properties of galaxies in rich 
environments at z ^ 2 is needed. 

6. SUMMARY AND CONCLUSIONS 

We measure the evolution of the 3.6 and 4.5/^m lumi- 
nosity functions for massive cluster galaxies using results 
from the SDWFS. We find that for z < 1.3 our data 
are consistent with passively evolving stellar populations 
with a mean formation redshift of z = 2.40JIq ]J assuming 
a Kroupa IMF and a = —0.8. We investigate a number 
of possible systematic errors that could be biasing our 
best fitting formation redshift, and summarize the most 
important here: 

1) We can only measure a in the lowest redshift bin 
of our data, and so must fix a in our fits. Our choice 
of a introduces an uncertainty in our formation redshift 
of Azf =io'o5 - Moreover fixing a means that we are 
not sensitive to any evolution in a. However our final 
formation redshift should not have a large dependence 
on any evolution in a because our ability to measure 
the location of the bright end of the LF is independent 
of what the faint end might be doing. As long as we 
choose a consistent value for a at all redshifts we can 
accurately measure any relative change in the bright end 
of the LF as a function of redshift, and therefore we can 
still properly measure the formation redshift of massive 
galaxies. 

2) Our choice of model set introduces a systematic un- 
certainty of Azf =lo45 in to ou r results if we consider 
ICB071 and lBC03l relative to MM . 

3) The uncertainty in the slope of the IMF near 1M@ 
introduces a bias into the evolution of the luminosity 
function on the order of ~ 0.4 mags per unit redshift. 
This bias is large enough that a sufficiently flat IMF can 
result in arbitrarily early formation redshifts. Given that 
Ivan Dokkuml ((2008) finds evidence for a flat IMF based 
on joint constraints from the color and mass-to-light evo- 
lution of their cluster galaxies, a flatter IMF is certainly 
plausible. This degeneracy between the formation red- 
shift and IMF is best broken by a combination of color 



and luminosity evolution (as was done by Ivan Dokkuml 
2008), a possibility which we intend to investigate with 
this cluster sample in future work. 

We generate a simple toy model to explore the growth 
of massive galaxies at low redshift, and find that our data 
are best fit with no additional growth through assembly 
for z < 1.3. However even for the simple model we gener- 
ate, our data are still consistent with some growth at low 
redshift. We conclude that even though our galaxies are 
well matched by passively evolving stellar populations at 
low redshift, this does not mean that there is no mass 
assembly of low redshift galaxies. 

Our best fit model disagrees significantly (~ 5a) with 
the highest redshift data (z — 1.3 — 2.0). We cannot 
completely rule out the possibility that this deviation 
is caused by systematic uncertainties such as errors in 
the photometric redshifts at high redshift or a high false 
detection rate of clusters. However we explore the impli- 
cations of what this deviation would mean if it is real, 
and conclude that it could be a signature of ongoing mass 
assembly. 
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